1 Introduction

There is a long history in astronomy of considering the likely birthplace of the Solar System (e.g., [1,2,3,4,5,6,7,8,9]). Previous reviews (e.g., [10, 11]) have considered the current features of the Solar System to infer the required birth conditions that lead to the present configuration. While the details are still debated, several key features are clear: The Solar System was likely born in a stellar cluster with at least one high-mass star that provided short-lived radioactive isotopes, and the cluster density was modest to prevent tidal effects disrupting the natal disk which would be inconsistent with the observed size of the Kuiper Belt. Taken together, these considerations suggest that the Solar System formed in an Orion-like cluster (see discussion in [10]).

One of the early clues was the presence of short-lived radioactive isotopes discovered in Solar System meteorites, in particular \(^{26}\)Al and \(^{60}\)Fe. The short radioactive decay times for both of these elements (half-life of 0.717 Myr and 2.62 Myr for \(^{26}\)Al and \(^{60}\)Fe, respectively) require that the Solar System formed near a source of these elements in both time and space. As reviewed recently by [12], the timescale is one of the most stringent limitations for how these elements were introduced into the Solar System. Studies of the pathway and likelihood of enrichment naturally led to the broader question of how often Earth-like planets (Solar System-like systems) form. This question is the subject of renewed interest as the number of known exoplanets continues to grow.

Surveys continue to reveal a huge diversity in exoplanet properties (see, e.g., [13] for a recent review). New missions and facilities continue to expand the sampled parameter space, probing lower-mass planets and larger separations (\(>1\) AU) from host stars. A model of planet formation that can reproduce the observed diversity requires considering the range of environments in which most stars and therefore most planets are born.

Making the direct connection between the observed demographics of exoplanets and their birth environments is difficult as most exoplanets are observed around mature stars. Few forming exoplanets have been directly detected in protoplanetary disks [14, 15]. Exoplanet systems have been detected in young open clusters, but with typical ages \(>100\) Myr (e.g., [16]), which is long after planet formation has completed.

While the community works to close this gap, plenty of progress can be made by considering the impact of the large-scale birth environment on exoplanets. Such outside-in models start with the observed characteristics of star-forming environments and model their impact on planet-forming disks. Many of the large-scale characteristics to consider for exoplanet formation are similar to the questions posed for the Solar System. This includes quantifying how feedback from the star-forming environment affects disk lifetimes and the architecture of the resulting planetary systems via external photoevaporation and tidal interactions (including stellar flybys). Enrichment with short-lived radioactive isotopes may play an important role in regulating the water budget of Earth-like (terrestrial) planets [17]. Connecting these things to exoplanet demographics requires considering the diversity of star-forming regions as planet-forming environments.

Recent years have seen significant observational investment in nearby low-mass regions where high-resolution studies of many low-mass stars are possible (see, e.g., reviews from [18, 19]). However, observations of the Milky Way and other galaxies suggest a cluster mass function of the form \(dN/dM \sim M^{-2}\). This means that most stars (\(\gtrsim 50\)%) form in regions forming one or more high-mass stars—star-forming complexes as massive (or more) than the Orion Nebula Cluster (ONC), itself a respectable 2000 M\(_\odot \). Higher-mass regions may have more—and more intense—ionizing feedback. Newly synthesized elements released in the deaths of high-mass stars, including short-lived radioactive isotopes, may pollute nearby planet-forming systems. More stars in high-mass regions may live in high-density environments. Finally, the large-scale characteristics of star-forming environments are not static. Understanding the evolution of star-forming regions is key to quantifying the range and impact of the local conditions a typical star experiences over its lifetime.

The density of the local star-forming environment is one of the key parameters that regulate local conditions for star- and planet-forming systems. The impact of external illumination from nearby high-mass stars depends on distance as \(r^{-2}\). Tidal interactions have a steeper \(r^{-3}\) dependence. Relatively modest changes in the typical distances between stars can therefore have a significant impact. Regions with high densities have shorter dynamical times and thus erase substructure and mass segregate more quickly. Lower-density regions will retain their substructure for longer, and the high-mass stars will not necessarily be centrally concentrated or clustered together at all.

Density is not static, and the density evolution of the region is key to determining which feedback mechanism dominates at a given time. On a global scale, dynamical evolution affects the amount of time that planet-forming disks are in a star-forming region. Determining disk lifetimes, especially in feedback-dominated regions, and comparing them to the timescale for planet formation is key to determine how important external feedback in shaping the outcome of planet formation.

Gas is an important component of the mass budget and the evolution of star-forming regions. In addition to impacting stellar dynamics by changing the gravitational potential (if, for example, gas is rapidly expelled from young clusters by feedback from winds and supernovae), the remnant gas in the region can protect disks from the radiation of nearby high-mass stars. Direct enrichment of molecular gas has also been proposed as an alternate pathway to enrich planet-forming material with short-lived radioactive isotopes (e.g., [20, 21]). Determining how long gas remains in region and its star-forming potential is therefore an important constraint.

In this review, we consider the large-scale environment of the star-forming complexes where most stars are born and how environmental influences affect the observed demographics of exoplanets. Dynamics in young star-forming complexes regulate the direct impact of photoevaporation and gravitational interactions—each of the topics that get their own chapters in this volume. In Sect. 2, we start by establish a working definition for ‘clustered’ star formation to frame how that environment affects planet formation. We discuss the dynamical evolution of clusters in Sect. 3 and simulations/models of the evolution of star clusters and how cluster evolution impacts planet formation in Sect. 4. In Sect. 5, we review recent observational results on the evolution of star clusters and the impact of feedback on populations of stars/disks in clustered regions. We review efforts to connect observed exoplanet demographics with environmental effects in Sect. 7 before discussing upcoming facilities and future research directions in Sect. 8. Finally, we provide a brief summary in Sect. 9.

2 ‘Clusters’ as short-hand for ‘stars forming near each other’

The claim that ‘most stars form in clusters’ is often repeated, but there is not a single agreed-upon definition for what constitutes a cluster. Within astronomy, different communities adhere to different definitions and methods depending on whether the goal is to identify systems that are gravitationally bound or those that are statistically clustered, as identified by algorithms like Density-Based Spatial Clustering of Applications with Noise (DBSCAN) [22].

The most common reference for the claim that ‘most stars form in clusters’ is the seminal review of [23]. From a census of star formation in the Solar neighborhood (within \(\sim 2\) kpc of the Sun), they argue that 90% of star formation happens in clusters. This definition includes both bound and unbound systems. [23] define a cluster as a group of stars that: (1) are physically related; (2) would be stable against tidal disruption if in virial equilibrium; and (3) have enough members so that evaporation time (i.e., when all members ejected by internal stellar encounters) is \(>10^8\) years (the lifetime of open clusters). These criteria are met for a collection of \(N \ge 35\) stars or cluster with a stellar mass density \(>1\) \(\text {M}_{\odot }\) pc\(^{-3}\). For comparison, the average stellar mass density in the Solar neighborhood is \(\sim 0.02\) \(\text {M}_{\odot }\) pc\(^{-3}\) (assuming an average stellar mass of \(\sim 0.2\) \(\text {M}_{\odot }\) and using the observed number density of \(\sim 0.1\) pc\(^{-3}\); [24]).

Identifying whether a collections of stars is gravitationally bound is difficult to establish observationally (e.g., Wd2; [25, 26]). The situation is even less clear in the star-forming stage. The surface density distribution of young stellar objects (YSOs) in nearby (\(<500\) pc) regions shows a smooth distribution [27], implying that there is no bimodal distribution to star formation, e.g., ‘clustered’ versus ‘isolated,’ but rather that star formation results from a continuum of densities.

For planet formation, the key is that stars form in close proximity to other stars. Most stars form in regions with higher densities than the field [28]. This will affect the incident radiation field (e.g., [29]) and whether planet-forming disks are close enough to a source of short-lived radioactive elements to be enriched (e.g., [5, 30]). Whether the system remains bound beyond \(\sim 10\) Myr is less important as planet formation is largely done by this point, although dynamical interactions after this time can alter the architectures of planetary systems (e.g., [31, 32]). Finally, while 90% of stars form in clusters in the [23] definition, only \(\le 4-7\)% of these survive as bound clusters after the embedded phase. Clusters are either unbound during their formation (e.g., [33]) or were not bound in the first place (e.g., [34]). The dissolution of previously clustered regions is a subject of great interest to connect birth conditions to demographics of exoplanets (and binary stars) observed in the field.

For this review, we consider stellar clustering in the statistical sense that stars form near other stars. This includes both bound and unbound systems. We consider how varying degrees of clustering or, more accurately, how different densities in clustered environments affect planet formation. At modest densities of a few stars pc\(^{-3}\), aggregates are a factor \(\sim \)10 more dense than the field [24]. At these densities, external photoevaporation already affects disks [35, 36]. At densities an order of magnitude higher, \(\sim \)100 stars pc\(^{-3}\), planetary systems are disrupted by dynamical interactions [4, 37]. At higher densities of \(\gtrsim 10^4\) stars pc\(^{-3}\), dynamical encounters truncate disks (e.g., [38, 39]). Additional considerations such as the virial state of the cluster or degree of substructure play a smaller role than density.

The cluster mass function determined from observations of the Milky Way and nearby galaxies is of the form \(dN/dM \sim M^{-2}\) [23, 40,41,42,43,44]. The same form of the cluster mass function is found for small and large clusters although these are measured in separate surveys with samples that do not overlap (and include both bound and unbound clusters; see discussion in [45]). Nevertheless, if we assume the same normalization, this implies that 1/2–2/3 of stars form in clusters larger than the ONC (as pointed out by [46]). These regions typically host one or more high-mass stars (\(\ge 10\) \(\text {M}_{\odot }\)). This proximity to high-mass stars leads to external photoevaporation of protoplanetary disks and makes possible the enrichment of planet-forming material with short-lived radioactive isotopes synthesized and ejected in the death of high-mass stars [47]. Even a single high-mass star can have a big influence—recall that luminosity scales with mass as \(L \propto M^{3.5}\) (although note that this flattens to \(L \propto M\) for masses \(\gtrsim 35\) \(\text {M}_{\odot }\); see e.g., [48]). Both external photoevaporation and enrichment can have a significant impact on planet formation even in regions that are not gravitationally bound.

While high-mass star-forming complexes like the Carina Nebula are relatively rare, the sheer number of stars that they form is comparable to the combined contribution of lower-mass regions. A single Carina-like region is forming more stars than nearby regions like the Gould Belt clouds combined [49, 50]. Note that a cluster mass function of the form \(dN/dM \sim M^{-2}\) implies that there are more low-mass clusters than high-mass, consistent with \(\sim 1/3 - 1/2\) of stars forming in low-mass regions. Existing observations overwhelmingly target these low-mass regions, but these conditions are only ‘typical’ for about half of star- and planet-forming systems. For this reason, we argue that more attention should be paid to low-mass stars—the most likely planet hosts—forming in high-mass star-forming regions. These conditions are not sampled in relatively quiescent nearby regions (although feedback can play a role there too, see e.g., [51]). Sampling high-mass star formation beyond the Orion star-forming complex requires pushing to the larger (\(>1\) kpc) distances required to probe these ‘more extreme’ environments. As we discuss in Sect. 8, upcoming facilities will make these kinds of observations routine.

If cluster disruption and dispersal were independent of mass, then high-mass clusters would be the dominant contributors to the field population of stars—where most known planets have been found so far. However, tidal stripping from the Galaxy depends on environment and will affect low-mass clusters more strongly [52], so the contribution of different star-forming regions to the field population may be biased to lower-mass regions.

Finally, a small fraction of stars are born in and remain in long-lived, gravitationally bound clusters. The most extreme examples of high-density, gravitationally bound clusters are globular clusters. In these more extreme regions, planet formation may be altered or suppressed altogether by the same mechanisms that make planet formation a challenge in less dense clusters—disk destruction by external irradiation and dynamical encounters (e.g., [4, 53]). Planets that do form are likely to be dynamically ejected (e.g., [31, 54,55,56,57]). The more extreme conditions in globular clusters may allow alternate pathways for planet formation altogether, such as gravitational instability in a circumbinary disk around evolved stars [58, 59]. However, only one planet has been detected in a globular cluster (in M4—one of the first exoplanet detections; see [60,61,62,63,64,65,66,67]) and no transiting exoplanets were detected in the globular clusters 47 Tucanae [68, 69] or NGC 6397 [70]. It is unclear whether the apparent paucity of exoplanets in globular clusters reflects suppressed formation due to intense stellar feedback, the lower metallicity of globular clusters (planet occurrence is positively correlated with stellar metallicity, [71, 72]), or dynamical ejection of planets in these dense systems. All of these factors likely contribute to the low frequency of exoplanet detections in globular clusters. Less clear is the fraction of planetary systems that are born in globular clusters in the first place. Milky Way star-forming regions do not sample the low metallicities nor the high masses and densities typical of globular clusters, so there are few observational constraints on the birth conditions. Extrapolating from Galactic clusters may give misleading estimates if exotic pathways, such as formation in the circumbinary disk around an evolved star, contribute significantly. On the observational side, the statistical evidence for a lower planet incidence in globular clusters compared to the field is not conclusive [73]. However, these challenging environmental conditions limit the likelihood that any planets in globular clusters are habitable planets [74], so we do not consider them further in this review.

3 The dynamical evolution of star clusters

As discussed in the previous section, density is a key parameter that determines how the clustered environment affects planet-forming systems. An example of the density dependence of different feedback mechanisms is provided in [75] (Fig. 1). [75] compared real cluster environments with the expected impact of external photoevaporation and tidal truncation. Radial averages in several famous star-forming clusters sample roughly 10 orders of magnitude in cluster density. Results from this work (and earlier studies, e.g., [10, 76]) established the density at which tidal interactions become important (\(\gtrsim 10^4\) pc\(^{-3}\)) and demonstrated that external photoevaporation affects disks at much lower densities, \(\sim \)few pc\(^{-3}\). As a result, external photoevaporation impacts a much larger fraction of stars. [75] conclude that, where present, external photoevaporation is a dominant effect, even in regions with sufficiently high densities for tidal truncation to play a role.

Fig. 1
figure 1

Density is one the central determinants of the role of feedback in star/planet formation, as illustrated by these figures from [75]. Left: external photoevaporation begins to impact cluster members at lower densities than tidal interactions. Right: radial-averaged densities of high-mass star-forming regions shown on the same scale for comparison. \(\copyright \) 2018 The Author(s) Published by Oxford University Press on behalf of the Royal Astronomical Society. This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)

The density of star clusters is not constant. How density evolves regulates when various feedback mechanisms have their strongest impact on planet-forming systems. [36] showed that photoevaporation has a stronger impact on regions that are initially low density but are undergoing collapse to higher densities. In contrast, [39] argue that cluster dynamics are most important for determining disk sizes through dynamical interactions. Density evolution in the first \(\sim 10\) Myr will have the strongest impact on planet formation. Subsequent density evolution may enhance the rate of dynamical encounters that will sculpt the architecture of exoplanet systems. Understanding density evolution is therefore key to constrain which forms of feedback are important at which time[s].

Based on the high observed fraction of stars born in clusters reported by [23] but the low fraction of bound clusters that remain after <10 Myr, a mechanism is needed to unbind clusters if they are born bound. [33, 77] proposed that rapid expulsion of gas would lead to a sudden density drop that would unbind the cluster, leading to expansion and dissolution. In this case, loose OB associations are the expanded remnants for formerly bound clusters. There are a few challenges to this picture, including the timescales involved. Feedback mechanisms like supernovae which provide the impulse to rapidly remove remaining cluster gas happen late (\(\gtrsim 10\) Myr) compared to the observed dispersal time of clusters (<10 Myr). Observations of the gas reservoir, particularly in high-mass star-forming regions, increasingly indicate that gas exhaustion not gas expulsion ends star formation (e.g., [78, 79]).

More recent work on the structure and kinematics of OB associations demonstrates that they are not the expanded remnants of previously high-density clusters. Two structural parameters indicate the degree of dynamical processing: mass segregation and the degree of substructure. [80, 81] showed that substructure only ever decreases. Highly substructured regions are therefore dynamically young. [82] find that lack of mass segregation characteristic of unbound regions, whereas mass segregation tends to increase in bound clusters. [83] demonstrated significant spatial substructure and minimal mass segregation in Cyg OB2—both indications that the region is dynamically young. This was confirmed with subsequent kinematic analysis. [84] find no evidence for global expansion in Cyg OB2 but significant kinematic substructure in the region that appears to be close to virial equilibrium. Both of these facts are in contradiction to the expectations of gas expulsion.

Since the work of [83, 84], Gaia has provided proper motions for kinematic analysis of a large number of star-forming regions. Studies of young star-forming clusters [85] and young open clusters [86] generally find that clusters are expanding. [86] find that the observed expansion is slower than expected from simulations of gas expulsion [33]. [87] show that the type of expansion observed by [86] could be a signature of cool collapse, with the virial state of the region fluctuating around equilibrium, but “freezing in” a supervirial velocity dispersion. [88] note that the lack of correlation between radial velocity and radius argues against expansion from a previous high-density state.

The interpretation of kinematic data remains debated, and discrepancies between radial velocity and proper motion data are observed (see discussion in e.g., [86]). Both expansion and collapse have been claimed in some regions and, in some cases, from the same data. [85] found evidence for mild expansion of the ONC using Gaia DR2 and no evidence for hierarchical assembly in the 28 clusters they consider. In contrast, [89] interpret their radial velocity data of the ONC to be consistent with models of cold collapse. Similarly, [90] find that the mass-dependent proper motion velocity dispersion seen in NGC 6530 is consistent with expectations for a cluster assembling hierarchically, in contrast to [85] who find a radially dependent expansion velocity. Breaking degeneracies between these interpretations requires combining analysis of spatial structure, 6D kinematics (x, y, z, v\(_x\), v\(_y\), v\(_z\)), and models (see [87]).

Furthermore, Gaia data have been used to constrain the initial densities of star-forming regions by determining the number, and velocity distribution, of runaway stars (stars moving faster than 30 km s\(^{-1}\)) and slower, walkaway stars (stars moving >5 km s\(^{-1}\)). Before Gaia, walkaway stars were impossible to trace back to star-forming regions, but many more walkaway stars are produced than runaways as their production rate depends on stellar density [91]. These new results from Gaia corroborate the earlier constraints on the initial conditions of star-forming regions from spatial structure and mass segregation [92].

Results from Gaia have been transformative in many ways, but because it is an optical mission, it is limited to mostly unobscured (and therefore older) regions. However, as [36] and others highlight, the initial density is a key parameter regulating how the larger environment affects planet-forming disks. Probing younger regions and identifying initial conditions is an important direction for the future. Upcoming IR-optimized facilities (see Sect. 8) will help. The sensitivity and angular resolution of existing facilities like the Atacama Large Millimeter/Submillimeter Array (ALMA) already permit studies of embedded low-mass stars in high-mass regions (e.g., [93,94,95]). Much less is known about the initial distribution of low-mass sources in high-mass regions compared to the well-studied local clouds, but this is key to constrain the impact of feedback in the early stages.

Focusing on earlier stages also requires considering the coevolution of stars and gas. As an example, consider the Orion star-forming complex. Gas kinematics suggests large-scale filament collapse is feeding Orion A [96, 97]. This is broadly consistent with gravity-only simulations of cold collapse from [98, 99] and observations of the radial velocities of stars in the ONC [89, 100]. Within the ONC itself, there is evidence for multiple populations [101, 102] with the youngest stars concentrated in the cluster center. [103] proposed that this may explain the ongoing presence of disks in the ONC despite their high observed photoevaporation rates (for which disks in a static, single-age cluster would already be completely destroyed; see Sect. 5 for further discussion).

Core-halo age gradients with younger stars concentrated in the center of clusters have also been suggested in the ONC and NGC 2024 using an X-ray chronometer [104]. Using this method, age gradients in several star-forming regions have been suggested [105, 106]. Such core-halo age gradients are a prediction of hierarchical collapse models [107, 108].

We use Orion to illustrate how the large-scale evolution can affect disks in star-forming regions—in this case, delivering young (disk-bearing) stars into the intense radiative environment near the high-mass stars. However, we caution that the interpretation of the kinematics of stars in the ONC and the Orion A cloud is the subject of ongoing debate [109] with arguments for neither expansion or contraction [110] and multiple studies finding evidence for expansion [85, 111, 112]. The large-scale complex may be affected by larger-scale influences as suggested by the combined motion of multiple clusters associated with the region [113, 114]. Nevertheless, Orion provides a useful illustration of how large-scale multi-wavelength observations with excellent spatial and velocity resolution can be used to unpack star-forming regions as the dynamic birthplaces of planets.

4 Models

4.1 Cluster evolution

Hydrodynamical simulations provide insights into the early stages of star formation, but cannot follow the long-term evolution. Typically, simulations can either resolve brown dwarfs and binaries (e.g., [115, 116]) or high-mass stars and the impact of their feedback (e.g., [117,118,119,120]), but not both. The newest generation of simulations now include multiple forms of stellar feedback—jets, winds, radiation, supernovae (e.g., STARFORGE; [121]). These simulations model star formation in giant molecular clouds, sampling the high-mass star-forming regions in which most stars form. At the same time, they resolve individual stars, allowing the direct connection between feedback in the larger star-forming environment to its impact on individual star- and planet-forming systems. Such simulations promise to be important for understanding the dynamical evolution of clusters during gas removal due to pre-supernova feedback as well as providing initial conditions to use for higher-resolution simulations of individual planet-forming disks.

There have been a few efforts to combine long-term N-body simulations with a gas potential (e.g., [122, 123]) and with hydrodynamically evolving disks (e.g., [38]). Such efforts require adopting realistic spatial and kinematic initial conditions, but this is a challenge given the mismatch in the angular resolution of available observations of stars and gas. Wide-field surveys like the Massive Young Star-forming Complex Study in Infrared and X-ray (MYStIX; [124]) and Star Formation In Nearby Clouds (SFiNCs; [125]) provide distributions of low-mass stars on scales of \(<1^{\prime \prime }\). In contrast, wide-field surveys of molecular gas distribution in star-forming regions are typically done with single-dish telescopes with angular resolution \(\sim 30^{\prime \prime }\), 1–2 orders of magnitude more coarse than the stellar measurements. Realistic discretization for grid-based simulations is a significant challenge. Observational improvements are on the horizon with the development with large aperture telescopes like the Atacama Large Aperture Submillimeter Telescope (AtLAST; [126]) which will provide an order of magnitude improvement in survey resolution (see Sect. 8).

4.2 Cluster impact on disks

Most studies of planet formation in the cluster environment focus on the direct impact of external feedback on protoplanetary disks. With growing evidence that planet formation happens early (e.g., [127]), there is growing urgency to consider earlier stages. If planet formation is well advanced before the disk is exposed to feedback from the nearby high-mass stars, then feedback will play a much smaller role in shaping the outcome of planet formation. One of the key questions now is how quickly planets form and how soon feedback affects ongoing planet formation to determine how long, if ever, these two processes are in direct competition.

When considering these earlier stages, the coevolution of gas and dust is very important beyond its role in regulating the dynamical evolution of the cluster. Remaining cloud material shields disks from the full strength of the radiation field of nearby high-mass stars, reducing the incident radiation and prolonging the lifetime of the disk. [128] proposed this as one explanation of the two populations of disks observed in NGC 2024. Simulations of feedback in a Carina-like high-mass star-forming region from [129] find that shielding by remnant gas can protect disk for \(<0.5\) Myr but that this may be enough to allow for significant evolution of the solids, helping ensure the mass budget for terrestrial planet formation. Within 2 Myr, the majority of disks in the region are affected by external photoevaporation, demonstrating that either planet formation must happen quickly or that giant planet formation may be suppressed in high-mass star-forming regions.

Several studies have specifically modeled the survival and enrichment of planet-forming disks in dynamically evolving clusters. [130] modeled direct enrichment of planet-forming disks with short-lived radioactive isotopes from supernovae in clusters of different morphology. They find a wide variation in the level and likelihood of enrichment, suggesting that Solar-System-like levels of enrichment may be environment dependent.

In light of this tension between disk lifetime and the time for high-mass stars to evolve off the main sequence and explode as supernovae, [131] explored supernova enrichment in low-mass clusters. They find that direct enrichment in clusters with one or two \(>20\) \(\text {M}_{\odot }\) stars is as likely as in higher-mass clusters because disks are destroyed faster in higher-mass clusters. Indeed, rapid disk dispersal is a challenge. [36] showed that disks are rapidly dispersed on short timescales, with half dispersed in 2 Myr in regions with densities as low as \(\sim 10\) \(\text {M}_{\odot }\) pc\(^{-3}\). Through a series of N-body simulations sampling a range of initial densities as well as cluster structures and bulk motions, they demonstrate that the initial density is the most important parameter regulating the impact of external photoevaporation on protoplanetary disks. More disks are photoevaporated in regions undergoing collapse than those in virial equilibrium or still expanding. In lower-density regions (\(\sim 10\) \(\text {M}_{\odot }\) pc\(^{-3}\)), substructure increases the rate that disks are destroyed, although other effects dominate at higher densities (\(\gtrsim 100\) \(\text {M}_{\odot }\) pc\(^{-3}\)). They conclude that either: planet formation must happen quickly; giant planet formation will be confined to smaller radii than the orbit of Neptune; or disk evaporation rates are overestimated (in contradiction to recent models; e.g., [132]).

Photoevaporation models strongly depend on assumptions about the initial disk radii as it primarily impacts the outer disk. Material at smaller disk radii and around higher-mass stars is more tightly gravitationally bound and therefore less vulnerable to removal by external irradiation [132]. For example, we consider a 1 M\(_\odot \) star surrounded by a disk with a radius \(r_{\mathrm{disk}} = 100\) AU and a mass \(M_{\mathrm{disk}} = 0.1\) M\(_\odot \) in a \(G_0 = 10{,}000\) radiation field (common in clusters containing high-mass stars). The FRIED grid of models of externally evaporating protoplanetary disks [132] predicts a photoevaporation rate of \(3 \times 10^{-6}\) M \(_\odot \) yr\(^{-1}\), so our example disk will be destroyed within a Myr. If the initial disk radius were instead 30 AU, the FRIED grid predicts a much lower photoevaporation rate of \(4 \times 10^{-8}\) M \(_\odot \) yr \(^{-1}\), corresponding to a survival time of several Myr. In practice, the resulting disk lifetime depends on the viscosity of the disk (which may help explain why the hot inner disks also disappear more quickly in feedback-dominated regions; see Sect. 5). Nevertheless, this example illustrates that our understanding of the distribution of initial disk radii is crucial; if the initial radii are several 100s AU (as in the DSHARP survey; [133]), we would expect photoevaporation to dominate over other disk destruction mechanisms.

[134] also examined how external photoevaporation in the clustered environment constrains the timescale for planet formation in simulations that include dynamical interactions and external photoevaporation. They find that \(\sim 50\)% of disks are destroyed in \(<2\) Myr in clusters with mass densities \(\rho \sim 50\) \(\text {M}_{\odot }\) pc\(^{-3}\). At higher mass densities, \(\rho \sim 100\) \(\text {M}_{\odot }\) pc\(^{-3}\), the fraction of disks destroyed within 2 Myr rises to 80%. Follow-up work in [135] demonstrated that disk destruction depends on the density of the region, in agreement with other studies. They find an order-of-magnitude variation in disk masses with order-of-magnitude variation in region density. They demonstrate that disk masses determined from the mm continuum flux in nearby high-mass star-forming regions are broadly consistent with their models.

Studies of planet-forming disks in the context of star-forming clusters are broadly in agreement. Disks are rapidly dispersed by external photoevaporation. More disks destroyed earlier in regions with higher densities and stronger UV fields. The fraction of disks destroyed varies between studies, but most agree on \(>50\)% in \(\lesssim 2\) Myr. Together, this strongly suggests that planet formation must happen early (\(<1\) Myr).

A growing number of studies are trying to connect the influence of clustered star-forming environment with the architectures of the resulting planet populations (e.g., [136,137,138,139,140,141,142,143]). Many of these focus on dynamical interactions that sculpt already formed planetary systems. However, a few look at the impact of the cluster environment during planet formation. [136] synthesize planet populations from a range of initial conditions for the planetary system and the clustered environment. They show that background heating in the cluster environment increases the temperature in outer parts of the disk, suppressing the formation of cold Jupiters to levels consistent with observations. Disk truncation due to stellar encounters may also reduce the fraction of cold Jupiters formed in clustered regions [142]. At the moment, these simulations treat different forms of feedback in isolation. As [142] urge, future work should consider the contributions of multiple feedback mechanisms acting in concert—external photoevaporation in addition to background heating and tidal interactions.

5 Observational signatures

Some of the first and most spectacular evidence of the impact of feedback from high-mass stars on planet-forming disks were images of the photoevaporating protoplanetary disks (proplyds) in Orion from the Hubble Space Telescope (HST; Fig. 2) [144, 145]. External UV radiation from nearby high-mass stars illuminates disks, driving a neutral photoevaporative flow that is then ionized. This creates a cometary or tear-drop shape with a rounded head pointed toward the dominant ionizing source and a tail that extends in the opposite direction. For nearby regions, the characteristic tear-drop shape can be identified in diffraction-limited images. Confirmed proplyds are detected near high-mass O- and B-type stars that drive high photoevaporation rates in the disks, \(10^{-8} - 10^{-6}\) \(\text {M}_{\odot }\) yr\(^{-1}\) [146,147,148,149,150]. With these high evaporation rates, proplyds should be rapidly dissipated.

Fig. 2
figure 2

An HST image of the ONC with postage-stamp images highlighting the famous proplyds. Credit: NASA, ESA, M. Robberto (Space Telescope Science Institute/ESA), the Hubble Space Telescope Orion Treasury Project Team and L. Ricci (ESO)

There have been multiple attempts to visually identify proplyds in star-forming regions other than the ONC. Most studies use narrowband images of hydrogen recombination lines (i.e., H\(\alpha \); Paschen-\(\alpha \)) to search for the characteristic tear-drop morphology. With this approach, candidate proplyds have been identified in NGC 1977 [151], NGC 2024 [152], Cyg OB2 [153], Carina [154], and NGC 3603 [155]. Some candidate proplyds are \(20-30\times \) the size of the ONC proplyds [155], calling into question their classification. Mass estimates for a few objects derived from millimeter measurements of the gas and dust are on the order of stellar masses, not disk masses (e.g., [156, 157]). Despite their similar morphology, many candidates may instead be evaporating gaseous globules (EGGs).

A few challenges frustrate efforts to identify proplyds in more distant high-mass regions. Many proplyd candidates are located several pc from the main ionizing sources, unlike the ONC proplyds which are detected within \(\sim 0.1\) pc of \(\theta ^1\)C. True proplyds, analogous to those in the ONC, are unlikely to be spatially resolved in more distant (\(>2\) kpc) regions. In their seminal paper, [146] show how the stand-off distance, \(r_{\mathrm {IF}}\), the distance between the disk and the ionization front of the neutral wind flowing from it, depends on the incident radiation. They show that \(r_{\mathrm {IF}}\) increases with distance from the ionizing source[s] but decreases as the radiation field gets stronger. Specifically, \(r_{\mathrm {IF}} \propto \Phi ^{-1} d^2\) , where \(r_{\mathrm {IF}}\) is the stand-off distance, \(\Phi \) is the incident ionizing flux, and d is the distance from the ionizing source. Typical \(r_{\mathrm {IF}}\) sizes are \(\sim 20-200\) AU [146, 152]. At the distance of the ONC (\(\sim 400\) pc; [158]), this corresponds to an angular size of \(\sim 0.05^{\prime \prime } - 0.5^{\prime \prime }\). From more distant regions like Cyg OB2 (1.33 kpc; [159]), a comparable proplyd would have an angular size of \(\sim 0.015^{\prime \prime } - 0.15^{\prime \prime }\) which would be difficult but possible to resolve with current facilities. Expected proplyd sizes are even smaller at the distance of Carina (2.3 kpc; [160]), with ONC-like objects having sizes of \(\sim 0.009^{\prime \prime } - 0.09^{\prime \prime }\). To make matters worse, these small sizes are optimistic upper limits given that the ionizing flux, \(\Phi \), is at least an order of magnitude higher in regions like Carina [161], with a corresponding decrease in \(r_{\mathrm {IF}}\) (a photoevaporation rate of globules in Carina is estimated to be \(\sim 10^{-5} - 10^{-6}\) \(\text {M}_{\odot }\) yr\(^{-1}\) [162, 163]). Spatially resolved observations are therefore likely beyond the reach of the James Webb Space Telescope (JWST). The shortest wavelength narrowband filter is Paschen-\(\alpha \) at 1.87 \(\mu \)m, providing \(0.07^{\prime \prime }\) imaging.

While the stronger radiation field reduces the likelihood that externally photoevaporating sources will be spatially resolved, stronger fields may affect a larger fraction of sources in a given region. As a result, interest is growing in identifying signatures of external photoevaporation that do not require spatially resolving the source morphology. Directions for future work include identifying photometric and spectroscopic signposts and predicting statistical prevalence in populations of different ages.

Millimeter continuum emission traces cold dust in outer disks. Recent measurements also suggest that disk masses are lower near high-mass stars. The ONC [164,165,166] and \(\sigma \) Ori [167] both show structure in the disk mass distribution. The lowest mass disks are where the impact of external photoevaporation is strongest, within \(\sim 0.5\) pc the main ionizing source. These radially dependent masses are not universally observed, however. [168] do not observe a trend in disk masses with distance from the O8V star in NGC 2024, in contrast to the ONC and \(\sigma \) Ori. This was confirmed with recent ALMA observations from [128] who suggest that remnant cloud gas may shield disks from the harshest radiation in this young (0.5 Myr) region, modulating disk destruction. Little gas remains to shield the disks in the older (\(\sim 1\) Myr) population in NGC 2024. As in the young regions, disks in the older region do not show a structured distribution of disk masses. Fewer disks are detected in the older population and their masses are lower, perhaps reflecting the more direct irradiation from the B-type star that illuminates the region. In young regions, remnant gas may play a more important than the earlier spectral type of the star for modulating the disk population.

Structured distributions of disks have also been suggested in higher-mass regions. Without high-resolution millimeter observations to measure disk masses, these studies use the presence or absence of a disk as traced by near-IR excess emission. Excess near-IR emission compared to the photosphere indicates the presence of a hot (\(\sim 2000\) K) inner disk. Radial averages suggest a higher fraction of disks further from the high-mass stars (e.g., [169, 170]). Such clear signatures are not seen in every region, especially where the high-mass stars are more widely distributed (e.g., [171]).

There are a few reasons that a structured distribution of disks may not be the best way to infer the impact of feedback on disks. Perhaps most fundamental is that the distribution of disk-bearing sources is seen in projection. Projected distances between low- and high-mass stars may substantially underestimate the true distance. Parallax measurements from Gaia are clarifying the 3D distribution of stars in clusters. Unfortunately, precision is lowest for the faintest (lowest mass) sources, but these are the most affected by external feedback. Regions with multiple high-mass stars may not be mass segregated, so disk-destroying radiation may affect the disk from multiple directions. In substructured regions, there may be less disk photoevaporation (compared to a smooth region with similar density) because on average, more stars are further from the high-mass stars [172]. However, stars move. For a velocity of 1 km s\(^{-1}\), a star will move 1 pc in 1 Myr so a star near a high-mass star now was not necessarily there in the past and will not necessarily be there in the future. Tracing stellar motions back in time is increasingly possible using data from Gaia, but this cannot capture the full dynamical history (e.g., [173]).

External irradiation also heats disks, leading to higher temperatures that may alter the chemistry and evolution of planet-forming material. At the most basic level, accurate disk temperatures are crucial to correctly determine the disk mass [174]. A higher disk temperature translates to lower estimated mass under standard assumptions (e.g., Table 1 in [175]). Using representative temperatures will be key to enable comparisons of disk masses and demographics within and between regions. External heating may also affect ice lines [174] with high-density systems potentially heating disks to the point where the ice line disappears, suppressing giant planet formation [176].

Dynamical interactions also perturb disks with observable consequences. Models of the impact of stellar flybys [177, 178] predict warps, spirals, truncation, and increases in the accretion rate. At the moment, the best observations of the impact of tidal interactions on protoplanetary disks come from nearby binary systems, for example, the multiple flybys sculpting RW Aur [179] and tidally truncated disks in Taurus [180]. Resolving comparable features in more distant regions will be a challenge. The ONC is the nearest high-density region at a distance of \(\sim 400\) pc [158]. This is \(\sim 2{-}3\times \) the distance of disks where detailed substructures have been resolved (e.g., [133]).

Fig. 3
figure 3

External photoevaporation may reduce the disk lifetime in regions with multiple high-mass stars. Figure from [181] showing the near-IR excess fraction, indicating the presence of a warm inner disk, as a function of the region age comparing the high-mass clusters in Carina (black dots) with lower-mass clusters from [182] (stars). Credit: Preibisch, T., et al., A &A, 530, A34, 2011, reproduced with permission \(\copyright \) ESO

Going forward, we expect renewed interest in population-level studies to quantify the impact of environment. The seminal paper of [182] showed a steady decline in the near-IR excess fraction (disks) as a function of age in nearby star-forming regions. [183] suggested that disk lifetimes might be shorter in regions with many high-mass stars, although in these distant regions that signal was difficult to disentangle from the faster evolution of high-mass sources. More recent work targeting truly low-mass stars in the Carina star-forming complex supports shorter disk lifetimes in higher-mass regions. [181] measured significantly lower disk fractions in the main clusters of Carina—Tr14, Tr15, and Tr16—than in similarly aged regions without high-mass stars (Fig. 3).

This result is not universal, with other studies finding no evidence that there are fewer disks near high-mass stars [184]. Larger, more uniform surveys of disks in high-mass regions will help clarify the impact of feedback on disks (e.g., [185]). Better age estimates will also be required to accurately assess changes in disk populations and demographics with time.

6 The role of cluster dynamics in enrichment

The origin and injection of short-lived radioactive isotopes has been a subject of long-standing interest for the Solar System [186,187,188,189,190,191]. The most attention has been paid to \(^{26}\)Al and \(^{60}\)Fe due to their high abundances and short radioactive decay times (half-life of 0.717 Myr and 2.62 Myr for \(^{26}\)Al and \(^{60}\)Fe, respectively). As such, they represent a significant heat source for early geochemical evolution. Given the important role these elements play in the Solar System, there is ongoing interest in how frequently Solar-System analogs are enriched (e.g., [7, 21]).

Determining the source of these elements provides one of the strongest constraints on the birthplace of a planetary system. Both elements are produced in the late stages of the evolution of high-mass stars. Supernova explosions are one of the largest sources of both elements. Wolf–Rayet (W–R) stars may also release significant amounts of \(^{26}\)Al in their intense winds. Other sources include evolved intermediate-mass asymptotic giant branch (AGB) stars, although these are unlikely to provide enrichment to other stars in their birth cluster while planet formation is ongoing. Alternatively, \(^{26}\)Al (but not \(^{60}\)Fe) may be produced locally within the disk by cosmic-ray spallation. See [12] for a more complete review.

For outside-in models that seek to understand how the larger star-forming environment affects the outcome of planet formation, the key questions are when, where, and how long these elements are present and, crucially, whether they mix with planet-forming material. Fortunately, \(^{26}\)Al and \(^{60}\)Fe emit \(\gamma \)-ray photons when they radioactively decay, so it is in principle possible to measure their abundance and distribution. Maps of the Milky Way from the COMPTEL and INTEGRAL \(\gamma \)-ray observatories (e.g., [192,193,194]) reveal \(^{26}\)Al to be confined close to Galactic plane and bright in high-mass star-forming regions [195, 196]. The situation is more complex for \(^{60}\)Fe. It has a lower abundance, with an observed \(^{60}\)Fe/\(^{26}\)Al \(\gamma \)-ray flux ratio of 18.4% ± 4.2% [197]. The Galactic \(^{60}\)Fe signal is too weak to constrain the spatial distribution of the emission. However, a comparison with the distribution of \(^{26}\)Al and plausible morphologies suggests that the elements have a different Galactic spatial distribution [197]. More sensitive maps of both \(^{26}\)Al and \(^{60}\)Fe are goals of the Compton Spectrometer and Imager (COSI; [198, 199]) and will hopefully improve the global view.

The \(\gamma \)-ray observations indicate that the majority of Galactic \(^{26}\)Al is produced by high-mass stars. The distribution within high-mass star-forming regions is poorly constrained as the resolution of the \(\gamma \)-ray observations are \(\gtrsim 1^{\circ }\). This is unlikely to improve in the immediate future given the enormous technical challenges involved in \(\gamma \)-ray observations. Models therefore remain a key element to connect the observed large-scale distribution of \(^{26}\)Al and the likelihood it enriches planet-forming material. One under-explored avenue is to examine the abundance of \(^{26}\)Al as a function of time. Population synthesis modeling of well-studied regions like Orion and Carina suggests a high abundance of \(^{26}\)Al that is maintained for several Myr [200, 201]. For regions with multiple high-mass stars, regular replenishment may sustain high levels for much longer than the 0.717 Myr half-life. [202] point out that this could enable significantly more systems to be enriched. Models including the production and mixing of \(^{26}\)Al on the cloud (e.g., [191]) and Galactic (e.g., [203, 204]) scales will provide more realistic constraints on the number of systems that are likely to be enriched.

Recent surveys of high-mass stars also provide improved inputs for modeling efforts (e.g., [205,206,207,208]). Two key physical properties merit renewed attention. First, stellar rotation rates affect how much \(^{26}\)Al mixes into the winds from high-mass stars (e.g., [209]). Fast rotators introduce \(^{26}\)Al earlier and for lower initial masses than non-rotators. However, the observed rotation rates of high-mass stars are slow (\(\sim 100\) km s\(^{-1}\)), more consistent with non-rotating models [210]. In the context of the star- and planet-forming environment, this will affect both how early and how long \(^{26}\)Al is available in the region. A second, related consideration is that most high-mass stars are in multiple systems and the majority (\(\ge 70\)%) will eventually interact [206]. Models for binary evolution are in active development. For \(^{26}\)Al, recent work from [211] suggests that binary interactions will increase the contribution from \(\sim 10{-}15\) \(\text {M}_{\odot }\) stars but that higher-mass stars (\(\gtrsim 30\) \(\text {M}_{\odot }\)) will still dominate the production of \(^{26}\)Al. Similar work for a broader range of elements may help further constrain which environments / stellar populations are likely to lead to enrichment with \(^{26}\)Al and other crucial elements.

Additional constraints come from the observed distribution of probable \(^{26}\)Al sources. A recent study used Gaia data to determine the environment of Galactic W–R stars. Outside of the Galactic Center, only \(18{-}36\)% of Galactic W–R stars appear to be associated with clusters, associations, and/or star-forming regions [212]. The percentage of apparently isolated W–R stars is much larger than main-sequence O-type stars. [212] estimate that \(\sim 20\%\) of the isolated W–R stars could be ejected from a clustered birth region by dynamical ejection or binary disruption. These results imply that a large fraction (\(\sim 45\%\)) of the progenitor stars form in low-density associations so that they appear isolated by the time they evolve to W–R phase. Any models of W–R enrichment should include the relatively small probability that a W–R star is located close to disk-bearing stars or an appreciable quantity of star-forming gas. For example, a study that estimated the likelihood that an AGB star pollutes a molecular cloud in the Solar neighborhood found a typical probability of \(\sim 1\)% [213].

One of the main challenges with external sources providing short-lived radioactive elements to planet-forming systems is the timescale. On the one hand, stellar evolution happens comparatively slowly with a 25 \(\text {M}_{\odot }\) star taking \(>7\) Myr to evolve off the main sequence and explode as a supernova. On the other hand, evidence is growing that planet formation happens early (\(<1\) Myr; see e.g., [127]). [46] suggest considering higher-mass clusters as more massive stars will have shorter lifetimes (and the highest yields; [211, 214,215,216,217]), reducing the discrepancy in these timescales. However, increased disk destruction from the more intense radiation fields of higher-mass stars may instead exacerbate the discrepancy. Enrichment of nearby star-forming gas may provide an alternate pathway, but the timescale problem remains as this still requires an evolved high-mass star located next to star-forming gas. Pre-supernova feedback appears to rapidly clear remnant cloud material from high-mass star-forming regions (e.g., [118, 218, 219]). Sequential star formation, as outlined by [220], provides another possibility. In this case \(^{26}\)Al and \(^{60}\)Fe come from (primarily) supernova explosions from the Sco–Cen association trigger, then enrich star-forming cores in the nearby L1688 star-forming region. This scenario requires multiple supernovae—first to set star formation in motion literally and figuratively with the blastwave from one or more supernovae, followed by enrichment. Better age estimates may provide a test of this scenario as star formation directly triggered by feedback from high-mass stars is expected to lead to large (\(>10\) Myr) age differences between populations [221].

Other enrichment scenarios, such as cosmic-ray spallation, may be less sensitive to environment. Recent models propose that cosmic rays accelerated in accretion shocks provide the necessary flux of energetic particles to produce \(^{26}\)Al locally [222]. In this case, more environments may be able to produce \(^{26}\)Al-rich systems. An environmental dependence may persist if external cosmic rays contribute significantly to \(^{26}\)Al production. High-mass stars themselves may be a significant source of cosmic rays [223]. Stellar winds and supernova explosions interacting with the surrounding medium may enhance the local cosmic-ray flux [224, 225]. Models including both internal and external cosmic-ray sources would be interesting to explore.

As an aside, we note that an enhanced cosmic-ray flux in high-mass star-forming regions would be interesting to explore more generally. Cosmic rays play an important role in planet-forming disks beyond element synthesis [226]. Cosmic rays are the only source of ionization deep in molecular clouds [227] and disk midplanes and thus enable chemistry in planet-forming disks [228,229,230]. Changes in the ionization level due to different cosmic-ray fluxes affect magnetically mediated processes like accretion and outflow that require some ionization to couple gas to the magnetic field (e.g., [231]). Higher ionization may enhance magnetic braking which [232] propose as an alternate explanation of the smaller disk sizes observed in high-mass star-forming regions.

7 Connecting birthplaces to planet properties

As discussed in the previous section, short-lived radioactive isotopes provide some archeological evidence for the birthplace of the Solar System and clues as to how the birth environment might affect exoplanet demographics. With the enhanced view of Galactic dynamics provided by Gaia, some groups have also tried to back out the birth environment of the Solar System dynamically. This includes work to reconstruct dissolved clusters to identify birth location and cluster origin of the Solar System [233]. The Galactic open cluster M67 has been proposed as an analog of the Solar System birthplace because of its similar age and metallicity. The cluster is sufficiently massive to have produced high-mass stars to provide enrichment with short-lived radioactive isotopes. [234] determine that stars escaping from M67 can reach Solar-like orbits. Most clusters do not survive for Gyr, so an origin in a surviving cluster like M67 would point to an unusual birthplace for the Solar System compared to most stars. External photoevaporation from high-mass stars in the cluster (that also provided short-lived radioactive isotopes) may have prevented the formation of Jupiter and Saturn.

Direct observations of planets in their birth clusters remain challenging with only a few confirmed detections of accreting protoplanets (i.e., PDS 70; [14, 15]). Planets have been detected orbiting stars in young (\(>100\) Myr) open clusters providing important age constraints for models [16, 235]. Planets in longer-lived \(\sim \) Gyr systems like NGC 6811 [236] provide an important test of how dynamical evolution shapes the architecture of planetary systems. [236] find that the statistical population of planets in open clusters like NGC 6811 is indistinguishable from the field. More recent work has argued for the dynamical creation of hot Jupiters in star clusters [237] with some observational evidence for an increased incidence in high-density stellar clusters [238]. These studies provide an important link to the observed demographics of exoplanets, but may be dominated by later dynamical processing that shapes planetary architectures post-formation.

The majority of exoplanets are not found in clusters. There is a great deal of interest in inferring their birth environments. Such forensic studies are more difficult than for the Solar System where detailed meteoritics provide stringent constraints. Some efforts have been made to connect exoplanet properties with the impact of their current and past environments. A series of papers beginning with [32] use the local position–velocity phase-space density to distinguish between exoplanet hosts in ‘over-density’ and ‘field’ environments. This analysis finds statistically significant differences in the occurrence rate of hot Jupiters in the field and over-density populations [32]. These statistical differences have also been used to argue that stellar clustering influences the planet radius valley [239], planetary multiplicity and orbital periods [240], and planet properties in multiple systems [241]. These results have inspired a lively debate in the literature about whether age and metallicity bias the correlation [242, 243].

The present-day phase-space density does not necessarily reflect the birth environment. A recent contribution argues that the over-density regions are the result of large-scale (Galactic-scale) perturbations [244]. Questions remain about the physical mechanism that drives the observed correlation. Densities in the ‘over-density’ regions are much lower than in star-forming clusters, with a correspondingly low encounter rate (see discussion in [244]). Interpreting the identified ‘over-density’ and ‘field’ regions in the context of known moving groups may help identify the relevant physical mechanism. Finally, a similar study investigating the demographics of stellar multiples in over-density and field regions would provide a larger statistical sample to investigate how phase-space overdensities affect the architectures of multiple systems.

8 Future directions and future facilities

In the coming decade, large-scale surveys from multi-object spectroscopic facilities will provide observations of more stars in more environments. The new 4-meter Multi-Object Spectroscopic Telescope (4MOST) [245] will be able to observe \(\sim 2400\) objects over a 4.2\(^{\circ }\) hexagonal field of view (FOV) and provide both moderate-resolution (\(R \approx 4000{-}7500\)) and high-resolution (\(R \approx 18{,}000{-}21{,}000\)) spectra. Science goals for the project include measuring radial velocities with a precision of \(<2\) km s\(^{-1}\) for Gaia sources (\({G_{\mathrm {Vega}}} < 20.5\) mag). Multiple community surveys with 4MOST target young stars and star-forming regions to understand the formation, evolution, and dissolution of star clusters. 4MOST is expected to start operations in early 2024.

The Multi-object Optical and Near-IR spectrograph (MOONS; [246]) will be installed on the 8-m Very Large Telescope (VLT) of the European Southern Observatory (ESO). MOONS makes use of the full 25\(^{\prime }\) diameter FOV of the VLT, deploying 1001 fibers to obtain high-resolution spectra (\(R \approx 20{,}000\)) with simultaneous optical and near-IR (\(0.6{-}1.8\) \(\upmu \)m) coverage. High-resolution coverage in the H-band will allow these observations to probe obscured regions, extending the work done at similar spectral resolution in nearby regions (e.g., by the IN-SYNC survey, [111, 247,248,249,250]) to regions more distant and more massive than Orion. Probing earlier stages in the dynamical evolution of star clusters will help constrain models.

Full 6D measurements of fainter stars will be possible with additional data releases from the ongoing Gaia mission [251,252,253]. Gaia will also continue to enable the discovery of previously unknown star clusters [254]. The current Gaia optical mission has enabled us to assess the kinematics of exposed (gas-free) clusters and also quantified the numbers of runaway (\(>30\) km s\(^{-1}\)) and walkaway (\(>5\) km s\(^{-1}\)) stars on the outskirts of star-forming regions [255, 256]. In the future, a near-IR version of Gaia would facilitate studies of the kinematics of star-forming regions at much earlier ages. Targeted studies of star-forming regions within 500 pc from the VISTA Star Formation Atlas (VISIONS; [257]) survey will provide multi-epoch near-IR imaging to enable proper motion measurements. Similar studies of more distant regions may be possible in the future with the wide field of view and sensitivity of the Roman Space Telescope.

The recent launch of JWST promises a wealth of near- and mid-IR observations of planet-forming disks. For Galactic high-mass star-forming regions beyond Orion, the angular resolution at these wavelengths will probably not suffice to spatially resolve proplyds, as discussed in Sect. 5. An additional complication for imaging studies is that the bright and variable nebulosity characteristic of these regions is likely to saturate in nearby regions. However, JWST will enable the first detection of low-mass stars in more distant regions, including those with lower metallicity. Spectroscopy will be especially powerful for probing planet-forming disks affected by feedback, quantifying its impact on the disk structure, chemistry, and mineralogy (e.g., GO 1759, GO 1905).

Wide-field surveys of molecular gas with high angular resolution will be possible with future large single-dish telescopes like AtLAST [126]. The proposed 50-m facility will provide \(\sim 5^{\prime \prime }\) angular resolution at 1 mm, a factor of \(\sim 6\) better than the current best-available survey data. The corresponding spatial resolution at the distance of Taurus (140 pc) is \(3 \times 10^{-3}\) pc or 700 AU; for Orion (400 pc), 0.01 pc or 2000 AU; and for Carina (2300 pc), 0.06 pc or 11,500 AU. Millimeter observations with angular resolution approaching that achieved at shorter wavelengths will enable more studies of the direct impact of feedback on the structure and kinematics of star-forming gas (e.g., [258]). More comparable resolution for maps of the gas and stellar distributions in young clusters will also significantly improve inputs for simulations of their coevolution (e.g., [123]).

The COSI [198] mission, recently selected through NASA’s astrophysics explorers program, will survey the sky in the soft \(\gamma \)-ray regime (\(0.2-5\) MeV). COSI is a factor of \(30-100\) more sensitive than the earlier generation of X-ray observatories with high spectral resolution to provide the first map of \(^{60}\)Fe (1.173 MeV and 1.333 MeV) in the Galaxy and isolate \(^{26}\)Al (1.809 MeV) from individual massive star clusters. The mission is currently in Phase B development with an expected launch in late 2025.

Finally, multiple 30-m class telescopes (extremely large telescopes; ELTs) are under construction with first light expected at the end of the decade. The transformative sensitivity and angular resolution ELTs provide the first resolved observations of individual objects in regions too distant for high spectral and/or spatial resolution observations with current facilities. High-mass star-forming regions (with typical distances \(\gtrsim 2\)  kpc) will be prime targets for these facilities, providing direct tests of the impact of more intense feedback from multiple high-mass stars.

The improvement in angular resolution and sensitivity provided by upcoming facilities will unlock the study of low-mass stars in high-mass star-forming regions, sampling a broader range of feedback conditions. Many of these facilities are IR-optimized, opening up younger regions for detailed scrutiny. This is critical to determine how quickly feedback affects planet-forming disks. Finally, connecting the dots from feedback in the environment during planet formation and the observed demographics of exoplanets will require creatively combining disparate observations.

For example, as discussed in Sect. 7, exoplanet host stars are nominally not traceable to a star-forming region, and thus, we have very little information on the environment in which that host star was born. The recent phase-space study of exoplanet host stars by [32] postulated that the dynamically ‘hot’ planetary systems correlated with dense phase space of the host star. While the interpretation that this is a relic of the birth environment is disputed [244], other planet properties may provide additional insight. A further novel way of constraining the birth environment of exoplanet hosts would be to accurately determine the radii of transiting terrestrial planets (e.g., with PLATO). Models with heating from short-lived radioactive isotopes (i.e., \(^{26}\)Al) predict that enriched systems have smaller radii [17]. For two planets of comparable composition, a planet without heating from short-lived radioactive isotopes would have a larger radius as a result of its higher volatile content. If planets forming in high-mass star-forming regions are more likely to be enriched with \(^{26}\)Al [202], this may provide another indirect indication of the birth environment.

For numerical simulations, environmental studies urgently need a critical appraisal of how effective gas/dust is at shielding disks from photoionizing radiation. The high dust/gas content in star-forming regions may shield disks from evaporation, but this depends on how quickly the high-mass stars blow huge cavities—as in the simulations of [117, 118]—that rapidly leave the disks exposed to photoionizing radiation. Spatially resolved observations of extinction in different clusters as a function of their age, size, and membership will be a key calibration of these models. Another crucial piece of the puzzle will be whether we can accurately determine the true effects of feedback on protoplanetary disks. Photoevaporation affects both the gas [132] and dust [259] component of disks, but models do not yet include both effects. Extending models of external photoevaporation by, e.g., [132], to maximum FUV fields \(>10^4\) G\(_0\) (the current limit) will also aid models of the time-evolution of star-forming regions like the ONC that likely had much higher initial FUV radiation fields [260].

9 Conclusions

Studies of star-forming clusters as the birthplace of planets are experiencing a renaissance as more exoplanets are discovered, and new facilities provide high-quality, large-scale survey data of star-forming regions. Most stars form in regions with higher densities than the Galactic field and will be affected by their stellar neighbors during the epoch of planet formation. Density is the strongest determinant of the impact of feedback. Dynamical interactions require high densities, while external photoevaporation of planet-forming disks is important in both high- and low-density regions. How density evolves with time determines when and how long different feedback mechanisms affect disks, and thus disk lifetimes in the region as a whole.

Models are in broad agreement that feedback rapidly disperses disks. Observational confirmation is more challenging. Some studies show a clear spatial structure with fewer and/or smaller and lower-mass disks near high-mass stars. Others do not, perhaps reflecting observational biases (projection effects), mixing of disk-bearing and disk-less sources by kinematics, and/or highly structured fields of radiation created by multiple high-mass stars. Recent work also suggests that star-forming regions may have spatially structured age differences, with younger sources concentrated toward the center. This may explain why disks are observed in regions with strong UV fields that should rapidly disperse them. If common, spatially structured age distributions will affect the apparent disk lifetime in feedback-dominated regions. While individual disks may be short-lived, disks may be present in the region for longer than the individual disk lifetime.

High-mass star-forming regions include thousands of low-mass stars. These are prime environments to quantify the impact of feedback but have historically been a challenge to observe given their typical distances of \(\gtrsim 2\) kpc. As a result, there is a growing interest in observational signatures of feedback on disks that does not require spatially resolved images. Surveys of disk demographics in truly high-mass regions would be a helpful starting point, although to date, there have only been a few targeted studies of millimeter dust emission from disks in regions with high-mass stars. In the future, observations with ELTs may provide resolved observations to directly test a few sources.

Models of star-forming clusters now include multiple forms of feedback. These will provide better initial conditions for disk models. Higher-resolution observations of the distribution of stars and gas in young regions will provide better inputs for N-body simulations coupled to a live background gas potential.

How frequently planet-forming systems are enriched with short-lived radioactive isotopes is of great interest given the important role these elements play in geochemical evolution. Galactic-scale observations reveal high abundances of elements like \(^{26}\)Al in high-mass star-forming regions with dying high-mass stars as their likely source. Whether these elements are available early enough and mix efficiently with planet-forming material remains an open question. Recent observations of high-mass stars provide some additional constraints for enrichment models. For example, Galactic W–R stars (expected to be one of the primary sources of \(^{26}\)Al) are infrequently found in star-forming regions. The observed rotation rates of high-mass stars are much slower than previously assumed in models, affecting when, how long, and from which stars \(^{26}\)Al may be introduced into the local star-forming environment. These and other challenges have brought renewed interest to local \(^{26}\)Al production by cosmic rays. High-mass stars themselves may be a significant source of cosmic rays and it would be interesting to consider this in models of \(^{26}\)Al production and indeed their impact on star-forming regions more generally.

We highlight upcoming facilities that will provide much-needed observational constraints. Large-scale spectroscopic surveys will provide radial velocities, complementing the wealth of proper motion information from Gaia. These data will also provide stellar parameters for thousands of young stars. New spectroscopic and imaging capabilities in the near-IR will be especially impactful as they will open up younger and more embedded regions, providing better constraints on the initial conditions in star-forming clusters.